What is claimed is: 
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1. A computerii^ storage and retrieval system of biological information, comprising: 
a data entry means; 

5 a display means; 

a programable central pftocessing umtr anc 

a data storage means havi\jg^NA sequences an^elated information electronically stored in 
a relational database; 

wherein the stored sequ^ces^re annotated and organi^ in a curated functional clustering 
10 arrangement. 

2. The computed^ storage ^id^trieval system of clkim 1, wherein the central 
processing unit is programnt^^with the ability to calculate significance values, perform gene 
annotation analysis, generate ffanscript imakes, perform transcript image analysis, perform 

15 subtractive analysis, perform el^tronic Nortnpm analysis, or perform electronic commonality 
analysis. 
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3. The computerized storage and retrieval system of claim 1, wherein the central 
processing unit is programmed to perform automate}^ bioanalysis on the stored cDNA sequences. 

4. The computerized storage and retrieval sysfi^m of claim 3, wherein the automated 
bioanalysis comprises: 

sequence editing; and 
annotation and organization of sequences. 
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5. The computerized storage and retrieval system of claim 4, wherein the sequence 
editing comprises the steps of: 

identifying and clipping vector\sequences; 
identifying and clipping unwantM functional motifs; 
5 identifying and removing cloning\and sequencing artifacts; and 
identifying and masking low information sequences. 

6. The computerized storage/^d retrieval system of claim 4, wherein the automated 
bioanalysis further comprises the st^ps of:^ 

10 transcript extension; and 
transcript expansion. 

7. The computerized storakfe^iid-l'eto^val system of clainj 1, wherein the stored cDNA 
sequences are comprised of SEQ IB) NOS. 1-10.\ 

15 \\ \ ^ 

8. The computerized storageNahd retrieval\system of claim 1, wherein the information 

pertaining to the cDNA sequences is stoJ^ in a plun^ity of tables, said tables organized into 
categories. 

20 9. The computerized storage and retrieval system of claim 8, wherein the categories 
comprise library preparation, clone preparation, sequencing^, sequencing equipment, sequencing 
reagents, function identification, and express sets. 

10. The computerized storage and retrieval system of ciVim 1, wherein the storage means 
25 can be searched to determine source tissue information, to deterniine source organ information, to 
determine source pathology information, or to determine source patient information. 
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11. A method for quantifyin^jthe relative abundance of mRNA species in a sample, said 
method comprising the steps of: 

generating cDNA sequences corre^nding to a representative population of transcripts found 
within a sample; 

5 organizing cDNA sequences into a filhctioiia^lustering arrangement; 

accessing a computerized storage aria retrieval sysfem of biological information containing 
reference cDNA sequence data corresponding to full-length\eference transcripts annotated and 
stored in a curated functional clustering arrangement; 

processing the sample sequefK^e data and ihe referencesequW^ in a programmed 
10 computer to generate an identified\sequence valbe fopsaefr'oftheW transcripts, said sequence 
value being indicative of a sequenca annotatitJn^ a degree of match between a transcript from 
the sample sequence data and at lea^ne transcripl from the reference sequence data; and 

processing each identified sequence value to generate final datawalues indicative of a number 
of times each identified sample sequence value is present within the pirated reference sequences. 
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12. The method of claim 11, wherein the refere^e sequence data is comprised of SEQ 
ID NOS. 1-10. 



13. The method of claim 11, wherein the method imised for transcript discovery. 
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14. A method for transcript image analysis comparing two or more samples, comprising 
the steps of: \ 

producing a first transcript image from the cDNA sequences corresponding to a 
representative population of the fuU-lMgth transcripts of a first sample; 
5 producing a second transcript image from the cDNA sequences corresponding to a 
representative population of full-length cranscripts of a second samples; and 

determining the ratio of the frequency that a cDNA sequence appears in the first and the 
second source tissues; \ 

wherein the ratio of the frequency thatVpDNA^SQmience appears in the first and the second 
10 samples is indicative of the relative levd^nexpression orHhe corresponding transcripts in each 
sample. \/ \ \ 

15. The method of claim 14 wherein sad secondjampiejl^ stored reference set of 
cDNA sequences. \ \^,,.....--"^\ \ 

15 \\ \ \ 

16. The method of claim 14, wh^ein the first sample is from normal tissue and the 

second sample is from a diseased or potenually disused sample. \A 

17. The method of claim 14, wherein the memod is used for transcript discovery. 
20 \ 

18. A method for performing electronic Northerns comprising the steps of: 
selecting libraries corresponding to samples of interest; 

selecting a cDNA sequence to examine in each selected library; and 
performing abundance analysis for said cDNA sequence in each library; 
25 wherein the abundance of cDNA sequence in each library is indicative of the location, 
distribution, and relative abundance of gene expression in the\selected samples. 
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19, The method of claim u^, wherein the gene expression is determined by the abundance 
of a cDNA sequence selected fronithe group consisting of: SEQ ID N0:1, SEQ ID N0:2, SEQ 
ID N0:3, SEQ ID N0:4, SEQ ID^0:5, SEQ ID N0:6, SEQ ID N0:7, SEQ ID N0:8, SEQ 
ID N0:9, and SEQ ID NO: 10. 



20. The method of claim 18, wherein the method is used for diagnostic purposes, 
prognostic purposes, or determining patient Veatment. 

21 . A method for performing elec^Mia commonality an^ysis between a first sample and 
10 a second sample, said method comprising the \tep(s of: 



producing a first transcript image 
representative population of the fuU-l 
producing a second transcript imaj 



rom the^pDNA sequences corresponding to a 
gth trans^p\s of a first 
from the\pEiNA"sequences corresponding to a 
sgapts (XX a\second sample; and\ 



representative population of full-length 
15 electronically comparing the transcri^ images o^the sample data set arid the reference data 
set to identify transcripts expressed in both^jf the two>samples; \ 

wherein normalized abundances are use^to determin^ a ratio of expressi<]|n between the two 
samples. 

20 22. The method of claim 21 wherein said second samjjle is a stored reference set of 
cDNA sequences. 

23. The method of claim 21, wherein the first sample is hipm normal tissue and the 
second sample is from a diseased or potentially diseased sample. 
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24. The method of claim 21 , wherein the sample is selected ftom the group consisting of 
blood, sputum, urine, ascites fluid, cerebrospinal fluid, and biopsy tissue. 



25. The method of claim 21, wherein the method is used for transcript discovery. 
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26. The method of claim 21, wherein the method is used for diagnostic puq)oses, 
prognostic purposes, or determimng patient treatment. 



27. A method for performing electronic subtraction analysis between a first sample and a 
5 second sample, said method comprising the steps of: 

producing a first transcript image from the cDNA sequences corresponding to a 
representative population of the full-length transcripts of a first sample; 

producing a second transcript image from the cDNA sequences corresponding to a 
representative population of full-length transcripts of a second sample ; and 
10 selecting a target abundance value foi\transcripts found in the sample sequence data and 
reference sequence data; 

processing this informationlto deteri^iitSOi^^ each sequence data set that 

\ Y \ 

exceed the selected target abundance value; and 



electronically comparing the 
15 set to identify transcripts expressed f 



script images of the sampi^ da^ set and the reference data 
only one of the two saj 



nd sample is a stored reference set of 



28. The method of claim 27 w^Vein 
cDNA sequences. 



20 29. The method of claim 27, wheretriythe first sample is from normal >tissue and the 
second sample is from a diseased or potentially diseased\sample. 
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- 30. The method of claim 27, wherein the sample is selected from the group consisting of 
blood, urine, sputum, ascites fluid, cerebrospinal fluid, and\biopsy tissue. 

31. The method of claim 27, wherein the method is used for transcript discovery. 



32. The method of claim 27, wherein the method is useM for transcript discovery, 
diagnostic purposes, prognostic purposes, or determining patientWeatment. 
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